Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | Go Back
View Post [edit]

Poster: schabotte Date: Dec 15, 2008 12:10pm
Forum: texts Subject: How do I take raw jpgs and make a djvu file?

I'm interested in taking some of the raw jpg files in some of the books (basically color plates and their descriptions) and building my own text searchable djvu file from them.

Does anyone know of the steps/tools that make this happen?

Thank you,
Steven

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or StaffNemo_bis Date: Sep 4, 2009 3:41pm
Forum: texts Subject: Re: How do I take raw jpgs and make a djvu file?

Here you can find a guide: http://en.wikisource.org/wiki/Help:DjVu_files (follow the various links).
To embed text is quite difficult with free software, AFAIK: I don't know if there's a OCR software which can do that. Maybe you'll need to create a pdf, first.

Reply to this post
Reply [edit]

Poster: donuil Date: Dec 15, 2008 11:12pm
Forum: texts Subject: Re: How do I take raw jpgs and make a djvu file?

Uncanny! I was just going to ask this. I'd also like to know if there are the necessary tools for Linux (or, at a pinch, Mac) to do this with. Too many books (especially unwieldy, big ones) are available only in PDF, which are quite impossible to use above a certain size. And the zipped packages of .jpg or .jp2 (how does one use jpg2000 anyway?) images are just sitting there laughing. I am not bothered about text-searching.

Every time I've attempted to convert an Internet Archive pdf to djvu using djvu2pdf, the programme crashes after 200 images, which oddly doesn't happen with any other scanned pdf such as Google ones.